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Modified Nucleotides 
FIELD OF THE INVENTION 

•The invention relates to modified nucleotides. 
In particular, this invention discloses nucleotides 
• having a removable blocking group and their use in 
polynucleotide sequencing methods, 

BACKGROUND 

Advances in the study of molecules have been led, 
in part, by improvement in technologies used to 
characterise the molecules or their biological 
reactions. In particular, the study of the nucleic 
acids DNA and RNA has benefited from developing 
technologies used for sequence, analysis and the study of 
hybridisation events. 

An example of the technologies that have improved 
the study of nucleic acids, is the development of 
fabricated arrays of immobilised nucleic acids. These 
arrays consist typically of a high-density matrix of 
polynucleotides immobilised onto a solid support 
material. See, e.g., Fodor et al., Trends Biotech. 
-12:19-26, 1994, which describes ways of assembling the 
nucleic acids using a chemically sensitized glass 
surface protected by a mask, but exposed at defined 
areas to allow attachment of suitably modified 
nucleotide phosphoramidites . Fabricated arrays can also 
be manufactured by the technique of " spotting" known 
polynucleotides onto a solid support at predetermined 
positions (e.g., Stimpson et al., Proc. Natl. Acad. Sci. 
USA 92:6379-6383, 1995). 

Sequencing by synthesis of DNA ideally requires the 
controlled (i.e. one at a time) incorporation of the 
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correct complementary nucleotide opposite the 
oligonucleotide being sequenced. This allows for 
accurate sequencing by adding nucleotides in multiple 
cycles as each nucleotide residue is sequenced one at a 
5 time, thus preventing an uncontrolled series of 
incorporations occurring. The incorporated nucleotide 
is read using an appropriate label attached thereto 
before removal of the label moiety and the subsequent 
next round of sequencing. In order to ensure only a 

10 single incorporation occurs, a reversible structural 
modification ("blocking group") of the sequencing 
nucleotides is required to ensure a single nucleotide 
incorporation but which then prevents any further 
nucleotide incorporation into the polynucleotide chain. 

15 The blocking group must then be removable, under 
reaction conditions which do not interfere with the 
integrity of the DNA being sequenced. The sequencing 
cycle can then continue with the incorporation of the 
next blocked, labelled nucleotide. In order to be of 

20 practical use, the entire process should consist of high 
yielding, highly specific chemical and enzymatic steps 
to facilitate multiple cycles of sequencing. 

SUMMARY OF THE INVENTION 

25 

In the present invention, a nucleoside or 
nucleotide molecule comprises a purine or pyrimidine 
base and a ribose or deoxyribose sugar moiety which has 
a removable 3 1 -OH blocking group covalently attached 

30 thereto, which renders the molecules useful in 
techniques requiring blocking of the 3 ' -OH group to 
prevent incorporation of additional nucleotides, such as 
for example in sequencing reactions, polynucleotide 
synthesis, nucleic acid amplification, nucleic acid 

35 hybridisation assays, single nucleotide polymorphism 
studies, and other such techniques. 



Therefore , according to a first aspect of the 
invention there is provided a modified nucleotide or 
nucleoside molecule comprising a purine or pyrimidine 
base and a ribose or deoxyribose sugar moiety having a 
removable 3 1 -OH blocking group covalently attached 
thereto, such that the 3» carbon atom has attached a 
group of the structure 

-O-X 

wherein X is any of -C(R') 2 -0-R, -C (R' ) 2 -N (R) 2 , - 
C(R') 2 -N(H)R, -C(R') 2 -S-R and -C(R') 2 ~F, 

wherein each R group is or is part of a removable 
protecting group; 

each R' is independently H or an alkyl; and wherein 
said molecule may be reacted to yield an intermediate in 
which each R group is exchanged for H or, where X is 
-C(R') 2 -F, the F is exchanged for OH, SH or NH 2 , pref- 
erably OH, which intermediate dissociates under aqueotis 
conditions to afford a molecule with a free 3 f 0H; 

with the proviso that where X is -C (R' ) 2 -S-R, both 
R' groups are not H. 

The intermediates produced advantageously 
spontaneously dissociate under aqueous conditions back 
to the natural 3 f hydroxy structure, which permits 
-further incorporation of another nucleotide Any 
appropriate protecting group may be used, as discussed 
hereinafter. Preferably, X is of formula -c(R') 2 -0-R, 
-C(R' ) 2 -N(R) 2 , -C(R' ) 2 -N(H) R and -C(R' ) 2 -SR. 

Particularly preferably, X is of the formula -C(R') 2 -0- 
R, -C(R' ) 2 -N(R) 2 , and -C(R') 2 -SR. One example of groups 
of structure -O-X wherein X is -C (R' ) 2 -N (R) 2 are those 
in which N(R) 2 is azido (-N 3 ) . One preferred such 
example is azidomethyl wherein each R' is H. R may also 
be a benzyl group or a substituted benzyl group in an 
alternative embodiment. 
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The molecule can be linked via the base to a 
detectable label by a desirable linker, which label may 
be a fluorophore, for example. The linker can be acid 
labile, photolabile or contain a disulphide linkage. 
5 Other linkers may be employed in this invention as 
described in greater detail below. 

The invention also features a method of controlling 
the incorporation of a nucleotide molecule complementary 

10 to the ' nucleotide in a target single stranded 
polynucleotide in a synthesis or sequencing reaction 
comprising incorporating into the growing complementary 
polynucleotide a molecule according to the invention, 
the incorporation of said molecule preventing or 

15 blocking introduction of subsequent nucleoside or 
nucleotide molecules into said growing complementary 
polynucleotide. The incorporation of the molecule may 
be accomplished via a terminal transferase, a polymerase 
or a reverse transcriptase. 

20 

Additionally, the invention also comprises a method 
for determining the sequence of a target single stranded 
polynucleotide comprising monitoring the sequential 
incorporation of complementary nucleotides, wherein the 

25 nucleotide is a molecule according to the invention as 
hereinbefore described that is linked to a detectable 
label via a cleavable linker and wherein the identity of 
each nucleotide molecule incorporated is determined by 
detection of the label linked to the base, and said 

30 blocking group and said label is removed prior to 
introduction of the next complementary nucleotide. In 
a preferred embodiment of this aspect of the invention, 
the blocking group and the label may be removed in a 
single chemical treatment step. Thus, in a preferred 

35 embodiment of the invention, the blocking group is 
chemically identical on each of the bases A,T,C and G 
and the block is cleaved simultaneously with the label. 
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Furthermore, preferably the blocked . and labelled 
modified nucleotide constructs of the nucleotide bases 
A,T,C and G are recognised as substrates by the same 
polymerase enzyme. 

In another aspect, the invention provides a method 
for determining the sequence of a target, single-stranded 
polynucleotide, comprising : 

(a) providing modified nucleotide molecules 
according to the hereinbefore described invention, 
linked by the base to a detectable label via a cleavable 
linker and wherein the detectable label linked to each 
type of nucleotide can be distinguished upon detection 
from the detectable label used for other types of 
nucleotides; 

(b) incorporating the nucleotide into the 
complement of the target single stranded polynucleotide; 

(c) detecting the label of the nucleotide of (b) , 
thereby determining the type of nucleotide incorporated; . 

(d) removing the label of the nucleotide of (b) 
and the blocking group; 

(e) optionally repeating steps (b)-(d) one or more 
times; 

thereby determining the sequence of a target 
single-stranded polynucleotide. In this aspect, the' 
label of the nucleotide and the blocking group are 
preferably removed in a single chemical treatment step. 
Thus, in a preferred embodiment of the invention, the 
blocking group is chemically identical on each of the 
bases A, T,C and G and the block is cleaved 
simultaneously with the label. Furthermore, preferably 
the blocked and labelled modified nucleotide constructs 
of the nucleotide bases A,T,C and G are recognised as 
substrates by the same polymerase enzyme. 

In the methods described herein, each of the 
nucleotides can be brought into contact with the target 
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sequentially, with removal of non-incorporated 
nucleotides prior to addition of the next nucleotide, 
where detection and removal of the label and the 
blocking group is carried out either after addition of 
5 each nucleotide, or after .addition of all four 
nucleotides . 

In the methods, all of the nucleotides can be 
brought into contact with the target simultaneously, 
10 i.e-, a composition comprising all of the different 
nucleotides is brought into contact with the target, and 
non-incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and the 
blocking group. 

15 

The methods can comprise a first step and a second 
step, where in the first step, a first composition 
comprising two of the four types of modified nucleotides 
is brought into contact with the target, and non- 
20 incorporated nucleotides are removed prior to detection 
and subsequent to removal of the label and blocking 
group, and where in the second step, a second 
composition comprising the two nucleotides not included 
in the first composition is brought into contact with 
25 the target, and non-incorporated nucleotides are removed 
prior to detection and subsequent to removal of the 
label and blocking group, and where the first steps and 
the second step can be optionally repeated one or more 
times - 

30 

The methods described herein can also comprise a 
first step and a second step, where in the first step, 
a composition comprising one of the four nucleotides is 
brought into contact with the target, and non- 
35 incorporated nucleotides are removed prior to detection 
and subsequent to removal of the label and blocking 
group, and where in the second step, a second 



composition comprising the three nucleotides not 
included in the first composition is brought into 
contact with the target, and . non-incorporated 
nucleotides are removed prior to detection and. 
subsequent to removal of the label and blocking group, 
and where the first steps and the second step can be 
optionally repeated one or more times. 

The methods described herein can also comprise a 
first step and a second step, where in the first step, 
a first composition comprising three of the four 
nucleotides is brought into contact with the target, and 
non-incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and 
blocking group and where in the second step, a 
composition comprising the nucleotide not included in 
the first composition is brought into contact with the 
target, and non-incorporated nucleotides are removed 
prior to detection and subsequent to removal of the 
label and blocking group, and where the first steps and 
the second step can be optionally repeated one or more 
times . 

In a further aspect, the invention features a kit, 
where the kit includes: (a) individual nucleotides 
according to the hereinbefore described invention, where 
each nucleotide preferably has a base that is linked to 
a detectable label via a cleavable linker, and where the 
detectable label linked to each nucleotide can be 
distinguished upon detection from the detectable label 
used for other three nucleotides; and (b) packaging 
materials therefor. The kit can further include an 
enzyme for incorporating the nucleotide into the 
complementary nucleotide chain and buffers appropriate 
for the action of the enzyme in addition to appropriate 
chemicals for removal of the blocking group and the 
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detectable label, which can preferably be removed by the 
same chemical treatment step. 

The nucleotides/nucleosides are suitable for use in 
5 many different DNA-based methodologies, including DNA 
synthesis and DNA sequencing protocols, including those 
involving single molecular arrays, particularly those 
disclosed in International Patent Application 
Publication No. WO00/06770. 

10 

According to another aspect of the invention, a 
method for determining the sequence of a target 
polynucleotide comprises monitoring the sequential 
incorporation of complementary - modified nucleotides 

15 according- to the invention, wherein the nucleotides 
comprise a detectable label linked to the base portion 
of the nucleotide via a cleavable linker, incorporation 
being detected by monitoring the label, and the label 
and blocking groups are removed to permit further 

20 nucleotide incorporation to occur. 

DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic illustration of the blocking 
25 groups that can be used according to the invention. 

Fig. 2 shows examplary nucleotide structures useful 
in the invention. For each structure, X can be H, 
phosphate, diphosphate or triphosphate. R x and R 2 can 

30 be the same or different, and can be selected from H, 
OH, or any group which can be transformed into an OH, 
including, but not limited to, a carbonyl . Some 
suitable functional groups for R x and R 2 include the 
structures shown in Fig. 4. 

35 Fig. 3 shows structures of linkers useful in the 

invention , including (1) disulfide linkers and acid 
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labile linkers, (2) dialkoxybenzyl linkers, (3) Sieber 
linkers, (4) indole linkers and (5) t-butyl Sieber 
linkers . 

Fig. 4 shows some functional molecules useful in 
the- invention, including some cleavable linkers and some 
suitable hydroxyl protecting groups. In these 
structures, R x and R 2 may be the same of different, and 
can be H, OH, or any group which can be transformed into 
an OH group, including a carbonyl. R 3 represents one or 
more substituents independently selected from alkyl, 
alkoxyl, amino or halogen groups. Alternatively, 
cleavable linkers may be constructed from any labile 
functionality used on the 3' -block. R 4 and R 5 can be H 
or alkyl, and R 6 can be alkyl,- cycloalkyl, alkenyl, 
cycloalkenyl or benzyl. X can be H, phosphate, 
diphosphate or triphosphate. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to nucleotide or 
nucleoside molecules that are modified by the reversible 
covalent attachment of a 3' OH blocking group thereto, 
and which molecule may be used in reactions where 
blocked nucleotide or nucleoside molecules are required, 
such as in sequencing reactions, polynucleotide 
synthesis and the like. 

Nucleotide, and more usually nucleotide 
triphospates, generally require a 3 1 OH blocking group 
so as to prevent the polymerase used to incorporate it 
into a polynucleotide chain, from continuing to 
replicate once the base on the nucleotide is added. 
There are many limitations on the suitability of a 
molecule as a blocking group. It must be such that it 
results in complete blockage of additional nucleotide 
molecules to the polynucleotide chain whilst 
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simultaneously being easily removable from the sugar 
moiety without causing damage to the polynucleotide 
chain. Furthermore, the modified nucleotide must be 
tolerated by the polymerase or other appropriate enzyme 
used to incorporate it into the polynucleotide chain. 
The ideal blocking group will therefore exhibit long 
term stability, efficient incorporation- by the 
polymerase enzyme, total blocking of secondary or 
further incorporation and the ability to be removed 
under mild conditions that do not cause damage the 
polynucleotide structure, such as under aqueous 
conditions. These stringent requirements are formidable 
obstacles to the design and synthesis of the requisite 
modified nucleotides. 

Reversible blocking groups for this purpose have 
been described previously but none of them generally 
meet the above criteria for DNA compatable chemistry. 



Protecting groups which comprise the acetal 
functionality have been used previously as blocking 
groups. However, removal of such groups, e.g. ethers 
requires strongly acidic deprotections detrimental to 
the DNA molecule. -The hydrolysis of an acetal however, 
25 results in the formation of an unstable hemiacetal 
intermediate which hydrolyses under aqueous conditions 
• to the natural hydroxyl group. The inventors have 
utilised this concept and applied it further such that 
the invention resides in utilising blocking groups that 
include protecting groups to protect intermediate 
molecules that would normally hydrolyse under aqueous 
conditions. These protecting groups comprise a second 
functional group that stabilises the structure of the 
intermediate but which can be removed at a later stage 
following incorporation into the polynucleotide. 
Protecting groups have been used in organic synthesis 
reactions to temporarily mask the characteristic 
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chemistry of a functional group because it interferes 
with another reaction. 

As is known in the art, a ''nucleotide" consists of 
a nitrogenous base, a sugar, and one or more phosphate 
groups. They are monomeric units of a nucleic acid 
sequence. In RNA, the sugar is a ribose, and in DNA a 
deoxyribose, i.e. a sugar lacking a hydroxyl group that 
is present in ribose. The nitrogenous base is a 
derivative of purine or pyrimidine. The purines are 
adenine (A) and guanine (G) , and the pyrimidines are 
cytosine (C) and thymine (T) (or ' in the context of RNA, 
uracil (U) ) . The C-l atom of deoxyribose is bonded to 
N-l of a pyrimidine or N-9 of a purine. A nucleotide is 
also a phosphate ester or a nucleoside, with 
esterif ication occurring on the hydroxyl group attached 
to C-5 of the sugar. Nucleotides are usually mono, di~ 
or triphosphates. 

A "nucleoside" is structurally similar to a 
nucleotide, but is missing the phosphate moieties. An 
example of a nucleoside analog would be one in which the 
label is linked to the base and there is no phosphate 
group attached to the sugar molecule. 

Although the base is usually referred to* as ' a 
purine or pyrimidine, the skilled person will appreciate 
that derivatives and analogs are available which do not 
alter the capability of the nucleotide or nucleoside to 
undergo Watson-Crick base pairing. "Derivative" or 
"analog" means a compound or molecule whose core 
structure is the same as, or closely resembles that of, 
a parent compound, but which has a chemical or physical 
modification, such as a different or additional side 
groups, which allows the derivative nucleotide or 
nucleoside to be linked to another molecule. For 
example, the base can be a deazapurine. The derivatives 



should be capable of undergoing Watson-Crick pairing. 
"Derivative" and "analog" also mean a synthetic 
nucleotide or nucleoside derivative having modified base 
moieties and/or modified sugar moieties. Such 
derivaties and analogs are discussed in, e.g., Scheit, 
Nucleotide Analogs (John wiley & Son, 1980) and Uhlman 
et al., Chemical Reviews 90:543-584, 1990. Nucleotide 
analogs can also comprise modified phosphodiester 
linkages, including phosphorothioate, 

phosphorodithioate, alkylphosphonate, phosphoranilida-t'e 
and phosphoramidate linkages. The analogs should be 
capable of undergoing Watson-Crick base pairing. 
"Derivative" and "analog", as used herein, may be used 
interchangeably, and are encompassed* by the terms 
"nucleotide" and "nucleoside" as defined herein. 

In the context of the present invention, the term 
"incorporating" means becoming part of a nucleic acid 
(eg DNA) molecule or oligonucleotide or primer. An 
oligonucleotide refers to a synthetic or natural 
molecule comprising a covalently linked sequence of 
nucleotides which are formed by a phosphodiester bond 
between the 3 1 position of the pentose on one nucleotide 
and the 5' position of the pentose on an adjacent 
nucleotide . 

The blocking groups used in accordance with the 
invention are shown schematically in Figure 1, and 
generally concern protected hemiacetals, protected 
hemiaminals and protected hemithioacetals as reversible 
blocking groups. In a preferred embodiment the 3 'OH 
group is blocked with, for example, an azidomethyl group 
as a protected form of a hemiaminal or 3,4- 
dimethoxybenzyloxymethyl group as a protected form of a 
hemiacetal. However, other suitable protecting groups 
may well be used as disclosed in Green & Wuts, 
Protective Groups in Organic Synthesis, John wiley & 



Sons . 



The term "alkyl" covers straight chain, branched 
chain and cyclic chain alkyl groups. Unless the context 
indicates otherwise, the term "alkyl" refers to groups 
having 1 to 8 carbon atoms, and typically from 1 to 6 
carbon atoms, for example from 1 to 4 carbon atoms. 
Examples of alkyl groups include methyl, ethyl, propyl, 
isopropyl, n-butyl, isobutyl, tert-butyl, n-pentyl, 2- 
pentyl, 3-pentyl, 2 -methyl butyl, 3-methyl butyl, and n- 
hexyl and its isomers . 

Examples of cycloalkyl groups are those having from 
3 to 10 ring' atoms, particular examples including those 
derived from cyclopropane, cyclobutane, cyclopentane, 
cyclohexane and cycloheptane, bicycloheptane and 
decalin. 

The term amino refers to groups of type NR X R 2 , 
wherein R 1 and R 2 are independently selected from 
hydrogen, a alkyl group (also referred to as C x _ e 

alkylamino or di-C^.g alkylamino) . 

The term "halogen" as used herein includes 
fluorine, chlorine, bromine and iodine. 

The nucleotide molecules of the present invention 
- are suitable for use in many different methods where the 
detection of nucleotides is required. 

DNA sequencing methods, such as those outlined in 
U.S. Pat. No. 5,302,509 can be carried out using the 
nucleotides . 

The present invention can make use of conventional 
detectable labels. Detection can be carried out by any 
suitable method, including fluorescence spectroscopy or 
by other optical means. The preferred label is a 
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fluorophore, Which, after absorption of energy, emits 
radiation at a defined wavelength. Many suitable 
fluorescent labels are known. For example, Welch et al. 
(Chem. Eur. J. 5 (3) : 951-960, 1999) discloses dansyl- 
5 functionalised fluorescent moieties that can be used in 
the present invention. Zhu et al. (Cytometry 28:206- 
211, 1997) describes the use of the fluorescent labels 
Cy3 and Cy5, which can also be used in the present 
invention. Labels suitable for use are also disclosed 

10 in Prober et al. (Science 238:336-341, 1987); Connell "et 
al. (BioTechniques 5 ( 4 ) : 342-384 , 1987), Ansorge et al. 
(Nucl. Acids Res. 15 ( 11 ): 4593-4 602 , 1987) and Smith et 
al. (Nature 321:674, 1986). Other commercially 

available fluorescent labels include, but are not 

15 limited to, fluorescein, rhodamine (including TMR, texas 
red and Rox) , alexa, bodipy, acridine, coumarin, pyrene, 
benzanthracene and the cyanins . 

Multiple labels can also be used in the invention. 
For example, bi-f luorophore FRET cassettes (Tet. Letts . 
20 46:8867-8871, 2000) are well known in the art and can be 
utilised in the present invention. Multi-fluor 
dendrimeric systems (J. Amer. Chem. Soc. 123:8101-8108, 
2001) can also be used. 

Although fluorescent labels are preferred, other 
25 forms of detectable labels will be apparent as useful to 
those of ordinary skill. For example, microparticles, 
including quantum dots (Empodocles, et al., Nature 
399:126-130, 1999), gold nanoparticles (Reichert et al., 
Anal. Chem. 72:6025-6029, 2000) and microbeads (Lacoste 
30 et al., Proc. Natl. Acad. Sci USA 97 (17) : 9461-9466, 
2000) can all be used. 

Multi-component labels can also be used in the 
invention. A multi-component label is one which is 
dependent on the interaction with a further compound for 
35 detection. The most common multi-component label used 
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in biology is the biotin-streptavidin system. Biotin is 
used as the label attached to the nucleotide base. 
Streptavidin is then added separately to enable 
detection to occur. Other multi-component systems are 
available. For example, dinitrophenol has a 

commercially available fluorescent antibody that can be 
used for detection. 

The invention will be further described with 
reference to nucleotides. However, unless indicated 
otherwise, the reference to nucleotides is also intended 
to be applicable to nucleosides. The invention will 
also be further described with reference to DNA, 
although the description will also be applicable to RNA, 
PNA, and other nucleic acids, unless otherwise 
indicated. 

The modified nucleotides of the invention use a 
cleavable linker to attach the label to the nucleotide. 
The use of a cleavable linker ensures that the label 
can, if required, be removed after detection, avoiding 
any interfering signal with any labelled nucleotide 
incorporated subsequently. 

Cleavable linkers are known in the art, and 
conventional chemistry can be applied to attach a linker 
to a nucleotide base and a label. The linker can be 
cleaved by any suitable method, including exposure to 
acids, bases, nucleophiles, electrophiles, radicals, 
metals, reducing or oxidising agents, light, 
temperature, enzymes etc. Suitable linkers can be 
adapted from standard chemical blocking groups, as 
disclosed in Greene & Wuts, Protective Groups in Organic 
Synthesis, John Wiley & Sons. Further suitable 

cleavable linkers used in solid-phase synthesis are 
disclosed in Guillier et al . (Chem. Rev. 100:2092-2157, 
2000) . 
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The use of the term "cleavable linker" is not meant 
to imply that the whole linker is required to be removed 
from the nucleotide base. The cleavage site can be 
located at a position on the linker that ensures that 
5 part of the linker remains attached to the nucleotide 
base after cleavage. 

The linker can be attached at any position on the 
nucleotide base provided that Watson-Crick base pairing 
can still be carried out. In the context of purine 

10 bases, it is preferred if the linker is attached via the 
7 position of the purine or the preferred deazapurine 
analogue, via an 8-modified purine, via an N-6 modified 
adenosine or an N-2 modified guanine. For pyrimidines, 
attachment is preferably via the 5 position on cytosine, 

15 thymidine or uracil and the N-4 position on cytosine. 

Suitable nucleotide structures are shown in Fig. 2. For 
each structure in Fig. 2, X can be H, phosphate, 
diphosphate or triphosphate. R x and R 2 can be the same 
or different, and can be selected from H, OH, or any 

20 group which can be transformed into an OH, including, 
but not limited to, a carbonyl. Some suitable 
functional groups for R x and R 2 include the structures 
shown in Fig. 4. 

Suitable linkers are shown in Fig. 3 and include, 
25 but are not limited to, disulfide linkers (1), acid 
'labile linkers (2, 3, 4 and 5; including dialkoxybenzyl 
linkers (e.g., 2), Sieber linkers (e.g., 3), indole 
linkers (e.g., 4), t-butyl Sieber linkers (e.g., 5)), 
electrophilically cleavable linkers, nucleophilically 
30 cleavable linkers, photocleavable linkers, cleavage 
under reductive conditions, oxidative conditions, 
cleavage via use of safety-catch linkers, and cleavage 
by elimination mechanisms. 
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A. Electrophilically cleaved linkers. 

Electrophilically cleaved linkers are typically 
cleaved by protons and include cleavages sensitive to 
acids- Suitable linkers include the modified benzylic 
systems such as trityl, p-alkoxybenzyl esters and p- 
alkoxybenzyl amides. Other suitable linkers include 
tert-butyloxycarbonyl (Boc) groups and the acetal 
system. 

The use of thiophilic metals, such as nickel, 
silver or mercury, in the cleavage of thioacetal or 
other sulphur-containing protecting groups can also be 
considered for the preparation of suitable linker 
molecules. 

B. Nucleophilically cleaved linkers. 

Nucleophilic cleavage is also a well recognised 
method in the preparation of linker molecules. Groups 
such as esters that are labile in water (i.e., can be 
cleaved simply at basic pH) and groups that are labile 
to non-aqueous nucleophiles, can be used. Fluoride ions 
can be used to cleave silicon-oxygen bonds in groups 
such as triisopropyl silane (TIPS) or t-butyldimethyl 
silane (TBDMS) . 

C. Photocleavable linkers. 

Photocleavable linkers have been used widely in 
carbohydrate chemistry. It is preferable that the light 
required to activate cleavage does not affect the other 
components of the modified nucleotides. For example, if 
a fluorophore is used as the label, it is preferable if 
this absorbs light of a different wavelength to that 
required to cleave the linker molecule. Suitable 
linkers include those based on O-nitrobenyl compounds 
and nitroveratryl compounds. Linkers based on benzoin 
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chemistry can also be used (Lee et al. , J- Org. Chem. 
64:3454-3460, 1999) . 

D. Cleavage under reductive conditions 

There are many linkers known that are susceptible 
5 to reductive cleavage. Catalytic hydrogenation using 
palladium-based catalysts has beei? used to cleave benzyl 
and benzyloxycarbonyl groups. Disulphide bond reduction 
is also known in the art. 

E. Cleavage under oxidative conditions 

10 Oxidation-based approaches are well known in the 

art. These include oxidation of p-alkoxybenzyl groups 
and the oxidation of sulphur and selenium linkers. The 
use of aqueous iodine to cleave disulphides and other 
sulphur or selenium-based linkers is also within the 

15 scope of the invention. 

F. Safety-catch linkers 

Safety-catch linkers are those that cleave in two 
steps. In a preferred system the first step is the 
generation of a reactive nucleophilic center followed by 
20 a second step involving an intra-molecular cyclization 
that results in cleavage. For example, levulinic ester 
linkages can be treated with hydrazine or photochemistry 
■ to release an active amine, which can then be cyclised 
to cleave an ester elsewhere in the molecule (Burgess et 
25 al., J. Org. Chem. 62:5165-5168, 1997). 

G. Cleavage by elimination mechanisms 

Elimination reactions can also be used. For 
example, the base-catalysed elimination of groups such 
as Fmoc and cyanoethyl, and palladium-catalysed 
30 reductive elimination of allylic systems, can be used. 




As well as the cleavage site, the linker can 
comprise a spacer unit. The spacer distances the 
nucleotide base from the cleavage site or label- The 
length of the linker is unimportant provided that the 
5 label is held a sufficient distance from the nucleotide 
so as not to interfere with any interaction between the 
nucleotide and an enzyme. 

In a preferred embodiment the linker may consist of 
the same functionality as the block. This will make the 
10 deprotection and deblocking process more efficient, as 
only a single treatment will be required to remove both 
the label and the block. 

A method for determining the sequence of a target 
polynucleotide can be carried out by contacting the 

15 target polynucleotide separately with the different 
nucleotides to form the complement to that of the target 
polynucleotide, and detecting the incorporation of the 
nucleotides. Such a method makes use of polymerisation, 
whereby a polymerase enzyme extends the complementary 

20 strand by incorporating the correct nucleotide 
complementary to that on the target. The polymerisation 
reaction also requires a specific primer to initiate 
polymerisation. 

For each cycle, the incorporation of the modified 
25 * nucleotide is carried out by the polymerase enzyme, and 
the incorporation event is then determined. Many 
different polymerase enzymes exist, and it will be 
evident to the person of ordinary skill which is most 
appropriate to use. Preferred enzymes include DNA 
30 polymerase I, the Klenow fragment, DNA polymerase III, 
T4 or T7 DNA polymerase, Taq polymerase or vent 
polymerase. A polymerase engineered to have specific 
properties can also be used. 



The sequencing methods are preferably carried out 



1 
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with the target polynucleotide arrayed on a solid 
support. Multiple target polynucleotides can be 
immobilised on the solid support through linker 
molecules, or can be attached to particles, e.g., 
5 microspheres, which can also be attached to a solid 
support material. 

The polynucleotides can be attached to the solid 
support by a number of means, including the use of 
biotin-avidin interactions. Methods for immobilizing 

10 polynucleotides on a solid support are well known in the 
art, and include lithographic techniques and "spotting" 
individual polynucleotides in defined positions on a 
solid support. Suitable solid supports are known in the 
art, and include glass slides and beads, ceramic and 

15 silicon surfaces and plastic materials. The support is 
usually a flat surface although microscopic beads 
(microspheres) can also be used and can in turn be 
attached to another solid support by known means. The 
microspheres can be of any suitable size, typically in 

20 the range of from 10 nm to 100 nm in diameter. In a 
preferred embodiment, the polynucleotides are attached 
directly onto a planar surface, preferably a planar 
glass surface. Attachment' will preferably be by means 
of a covalent linkage. Preferably, the arrays that are 

25 used are single molecule arrays that comprise 
polynucleotides in distinct optically resolvable areas, 
e.g., as disclosed in International App. No. WO 
00/06770. 

The sequencing method can be carried out on both 
30 single polynucleotide molecule and multi-polynucleotide 
molecule arrays, i.e., arrays of distinct individual 
polynucleotide molecules and arrays of distinct regions 
comprising multiple copies of one individual 
polynucleotide molecule. Single molecule arrays allow 
35 each individual polynucleotide to be resolved 
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separately. The use of single molecule arrays is 
preferred. Sequencing single molecule arrays non- 
destructively allows a spatially addressable array to' be 
formed. 

5 The method makes use of the polymerisation reaction 

to generate the complementary sequence of the target. 
The conditions necessary for polymerisation to occur 
will be apparent to the skilled person. 

To carry out the polymerase reaction it will 
10 usually be necessary to first anneal a primer sequence 
to the target polynucleotide, the primer sequence being 
recognised by the polymerase enzyme and acting as an 
initiation site for the subsequent extension of the 
complementary strand. The primer sequence may be added 
15 as a separate- component with respect to the target 
polynucleotide. Alternatively, the primer and the 
target polynucleotide may each be part of one single 
stranded molecule, with the primer portion forming an 
intramolecular duplex with a part of the target, i.e., 
2 0 a hairpin loop structure. This structure may be 
immobilised to the solid support at any point on the 
molecule. Other conditions necessary for carrying out 
the polymerase reaction, including temperature, pH, 
buffer compositions etc., will be apparent to those 
. 25 skilled in the art. 

The modified nucleotides of the invention are then 
brought into contact with the target polynucleotide, to 
allow polymerisation to occur. The nucleotides may be 
added sequentially, i.e., separate addition of each 
30 nucleotide type (A, T, G or C) , or added together. If 
they are added together, it is preferable for each 
' nucleotide type to be labelled with a different label. 
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This polymerisation step is allowed to proceed for 
a time sufficient to allow incorporation of a 
nucleotide . 

Nucleotides that are not incorporated are then 
5 removed, for example, by subjecting the array to a 
washing step, and detection of the incorporated labels 
may then be carried out. 

Detection may be by conventional means, for example 
if the label is a fluorescent moiety, detection of an 

10 incorporated base may be carried out by using a confocal 
scanning microscope to scan the surface of the array 
with a laser, to image a fluorophore bound directly to 
the incorporated base. Alternatively, a sensitive 2-D 
detector, such as a charge-coupled detector (CCD), can 

15 be used to visualise the individual signals generated. 

However, other techniques such as scanning near-field 
optical microscopy (SNOM) are available and may be used 
when imaging dense arrays. For example, using SNOM, 
individual polynucleotides may be distinguished when 

20 separated by a distance of less than 100 nm, e.g., 10 nm 
to 10 pm. For a description of scanning near-field 
optical microscopy, see Moyer et al., Laser Focus World 
29:10, 1993. Suitable apparatus used for imaging 
polynucleotide arrays are known and the technical set-up 

25 will be apparent to the skilled person. 

After detection, the label may be removed using 
suitable conditions that cleave the linker and the 3 f 
block to allow for incorporation of further modified 
nucleotides of the invention. 



30 
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EXAMPLES 



1. 3 ! -0H protected with an azidomethyl group as a 
protected form of a hemiaminal : 




Nucleotides bearing this blocking group at the 
3 1 position have been shown to be successfully 
incorporated by a number of different polymerases block 
efficiently and may be subsequently removed under 
10 neutral, aqueous conditions using water soluble 
phosphines or thiols allowing further extension: 



Al- 



ls 



P(m-PhS03Na) 3 / H 2 6 / 50°C / 1 5mins 



100% 



Stable 



Incorporated 
by enzymes 
Efficient blocking 
observed 



20 



intermediate 
-spontaneously 
degrades to product 



OH 



De-blocked 3'-OH 

ready for next incorporation 



« 
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2 . 3 1 -OH protected with a 3, 4-dimethoxybenzyloxymethyl 
group as a protected form of a hemiacetal: 




OMe 
OMe 



Nucleotides bearing this blocking group have similar 
properties to the above example-, though incorporate less 
rapidly. Deblocking can be achieved efficiently by the 
use of aqueous buffered cerium ammonium nitrate or DDQ, 
10 both conditions initially liberating the hemiacetal (1) 
which decomposes to the required (2) prior to further 
extension: 



15 



^OH ~ OH 

(1) (2) 



The 3' -OH may also be protected with benzyl groups 
where the phenyl group is unsubstituted, e.g. with 
benzyloxymethyl, as well as benzyl groups where the 
phenyl group bears electron-donating substituents ; an 
2 0 example of such an electron-rich benzylic protecting 
group is 3 , 4-dimethoxybenzyloxymethyl . 

In contrast, electron-poor benzylic protecting 
groups, such as those in which the phenyl ring is 
substituted with one or more nitro groups, are less 

25 preferred since the conditions required to form the 
intermediate groups of formulae -C(R') 2 -° H r -C(R') 2 -NH 2 , 
and -C(R') 2 -SH are sufficiently harsh that the integrity 
of the polynucleotide can be affected by the conditions 
needed to deprotect such electron-poor benzylic 

30 protecting groups. 
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3. 3' -OH protected with a f luoromethyloxymethyl group 
as a protected form of a hemiacetal: 

-0-CH 2 -F 

Nucleotides bearing this blocking group may be converted 
to the intermediate hemiacetal using catalytic reactions 
known to those skilled in the art such as , for example, 
those using heavy metal ions such as silver. 
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Claims 

1. A modified nucleotide or nucleoside molecule 
comprising a purine or pyrimidine base and a ribose or 
deoxyribose sugar moiety having a removable 3 ' -OH 
5 blocking group covalently attached thereto, such that 
the 3 ! carbon atom has attached a group of the structure 



-0-X 



10 wherein X is any of -C(R' ) 2 -0-R, -C (R/ ) 2 -N (R) 2 , ~ 

C(R' ) 2 -N(H)R, -C(R' ) 2 -S-R and -C(R') 2 -F, 

wherein each R group is' or is part of a removable 
protecting group; 

each R' is independently H or an alkyl; and wherein 
15 said molecule may be reacted to yield an intermediate in 
which each R group is exchanged for H or, where X is 

-C(R') 2 - F > the F is exchanged for OH, which intermediate 
dissociates under aqueous conditions to afford a 
molecule with a free 3'OH; 

20 with the proviso that where X is -C(R') 2 -S-R, both 

• R' groups are not H. 

2. A molecule according to claim 1 or 2, wherein R is 
a benzyl or substituted benzyl group. 

25 

3. A molecule according to claim 1 wherein X is an 
azidomethyl group. 



4 . A molecule according to any preceding claim wherein 
said base is linked to a detectable label via a 
cleavable linker. 



5. A molecule according to claim 4, wherein said 
detectable label is a fluorophore. 



6. A molecule according to claim 4 or 5 wherein said 
linker is acid labile, photolabile or contains a 
disulphide linkage . 

7. A method of controlling the incorporation of a 
nucleotide molecule complementary to the nucleotide in 
a target single stranded polynucleotide in a synthesis 
or sequencing reaction comprising incorporation into the 
growing complementary polynucleotide a molecule 
according to any of claims 1 to 6 the incorporation of 
said molecule preventing or blocking introduction of 
subsequent nucleoside or nucleotide molecules into said 
growing complementary polynucleotide. 



8. A method according to claim 7, wherein the 
incorporation of said molecule is accomplished by a 
terminal transferase or polymerase or a reverse 
transcriptase . 



9. A method for determining the sequence of a target 
single stranded polynucleotide, comprising monitoring 
the sequential incorporation of complementary 
nucleotides, wherein the nucleotide is a molecule 
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according to any of claims 1 to 6 and include a 
detectable label linked to the base via a cleavable 
linker and wherein the identity of each nucleotide 
molecule incorporated is determined by detection of the 
5 label linked to the base, and said blocking group and 
said label is removed prior to introduction of the next 
complementary nucleotide. 



10. A method, according to claim 9, wherein said 
10 blocking group and said label are removed in a single 
step. 



11. A method for determining the sequence of a target 
single-stranded polynucleotide, comprising: 

15 (a) providing nucleotides according to any of 

claims 1 to 3 linked via the base to a detectable label 
via a cleavable linker and wherein the detectable label 
linked to each type of nucleotide can be distinguished 
upon detection from the detectable label used for other 

20 types of nucleotides; 

(b) incorporating the nucleotide into the 
complement of the target single stranded polynucleotide; 

(c) detecting the label of the nucleotide of (b) , 
thereby determining the type of nucleotide incorporated; 

25 (d) removing the label of the nucleotide of (b) 

and the blocking group; and 

(e) optionally repeating steps (b)-(d) one or more 
times; 

thereby determining the sequence of a target 
30 single-stranded polynucleotide. 



12. A method according to claim 11 wherein the label of 
the nucleotide and the blocking ' group are removed in a 
single chemical treatment step. 



13. A method according to claim 12 or 13, wherein each 
of the nucleotides are brought into contact with the 
target sequentially, with removal of non-incorporated 
nucleotides prior to addition of the next nucleotide, 
and wherein detection and removal of the label and the 
blocking group is carried out either after addition of 
each nucleotide, or after addition of all four 
nucleotides . 



14. A method According to claim 12 or 13, wherein each 
of the nucleotides are brought into contact with the 
target together simultaneously, and non-incorporated 
nucleotides are removed prior to detection and 
subsequent to removal of the label and the blocking 
group . 



15. A method according to claim 12 or 13, comprising a 
first step and a second step, wherein in the first step, 
•a first composition comprising two of the four 
nucleotides is brought into contact with the target and 
non-incorporated nucleotides are removed prior to 
detection and subsquent to removal of the label, and 
wherein in the second step, a second composition 
comprising the two nucleotides not included in the first 
composition is brought into contact with the target, and 
non-incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and 
blocking group, and wherein the first and second steps 
are optionally repeated one or more times. 
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16. A method according to claim 12 or 13 comprising a 
first step and a second step, wherein in the first step, 
a composition comprising one of the four nucleotides* is 
brought into contact with the target, and non- 

5 incorporated nucleotides are removed prior to detection 
and subsequent to removal of the label and blocking 
group and wherein in the second step, a second 
composition comprising the three nucleotides not 
included in the first composition is brought into 
10 contact with the target, and non-incorporated 
nucleotides are removed prior to detection and 
subsequent to removal of the label and blocking group 
and wherein the first steps and the second step are 
optionally repeated one or more times. 

15 

17. A method according to claim 15 comprising a first 
step and a second step, wherein in the first step, a 
first composition comprising three of the four 
nucleotides is brought into contact with the target, and 

20 . non-incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and 
blocking group and wherein in the second step, a 
composition comprising the nucleotide not included in 
the first composition is brought into contact with the 

25 target, and non-incorporated nucleotides are removed 
prior to detection and subsequent to removal of the 
label and blocking group and wherein the first steps and 
the second step are optionally repeated one or more 
times . 

30 18. A kit, comprising: 

(a) individual nucleotides according to any of 
claims 1 to 3; and 

(b) packaging materials therefor. 
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19. A kit according to claim 18, wherein said 
nucleotides are linked via the base to a detectable 
label by a cleavable linker, and wherein the detectable 
label can be distinguished upon detection from the 
detectable label used for any of the other three types 
of nucleotide. 



20. The kit of claim 18 or 19, further comprising an 
enzyme and buffers appropriate for the action of the 
enzyme. 
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Protected hemiacetals: 




Protected hemithioacetals: 




Protected hemiaminals: 




Fig. 1 



THIS PAG? •* - ' 



2/4 



XO' 



or 



Uridine OS-linker 




Linker^ 



•Label 



R n R 2 ^ 
Cytidine C5-linker 
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Linker^ 



Label 



NH 2 



Ri. R 2 
N7 Oeazaadenosine C7-linker 
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Linker^ 



Label 
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N7 Deazaguanostne C7Minker 



Linker—Label 



Rr R? 
.Adenosine N6-Knker 



/r^°S>K^J ' Linker*. 
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Label 



Cytidine N4-linker 



where R n and R 2 , which may be the 
same or different, are each selected 
from H. OH r or a^group which can be 
transformed-into-an i OH. Suitable groups 
for Ri and R 2 are described in Figure 4 



X = H; phosphate, diphosphate or triphosphate 



^Linker— Label 
Guanosine N2-linker 



Fig, 2 
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Label -"^Cleavable linker Base 




OX 



wheTe R! and R 2> which may be the same or 
different, are each selected from H, OH, or 
any group which can be transformed into an 
OH, including a carbonyl 



-B 



Cleavable linkers may include; 



s-j-s 

v. 





o 

.b££ 




R 3 " 



R3 represents one or more 
substituents independently 
selected from alkyl, alkoxy, 
amino or halogen 



R t and R 2 groups may include 



> 5 
R4 -s. 

R6 



\ 

R 6 



> 5 
R^F 



> 5 



where R 4 is H or alkyl, R 5 is H or 
. alkyl and R 6 is alkyl, cycloalkyl, 
alkenyl, cycloalkenyl or benzyl 

and X is H, phosphate, diphosphate or triphosphate 



Alternatively, cleavable linkers may 
be constructed from any labile 
functionality used on the 3 '-block 



Fig. 4 



